Scene image representation by foreground, background and hybrid features

نویسندگان

چکیده

Previous methods for representing scene images based on deep learning primarily consider either the foreground or background information as discriminating clues classification task. However, also require additional (hybrid) to cope with inter-class similarity and intra-class variation problems. In this paper, we propose use hybrid features in addition represent images. We suppose that these three types of could jointly help image more accurately. To end, adopt VGG-16 architectures pre-trained ImageNet, Places, Hybrid (both ImageNet Places) datasets corresponding extraction foreground, information. All are further aggregated achieve our final representation Extensive experiments two large benchmark (MIT-67 SUN-397) show method produces state-of-the-art performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Object Category Recognition by Bag-of-Features Using Co-Occurrence Representation by Foreground and Background Information

متن کامل

Thesis Proposal: Learning Image Patch Representation for Detection, Recognition and Dynamic Foreground/Background Extraction

Using feature based (ie. interest image features that satisfy some metrics of geometric or photometric invariance [39, 50, 66, 74, 73]) or direct image methods (ie. derivatives or differences of image intensity patterns [2, 37, 49]) for 3D object/scene model construction [48, 97], image content retrieval [85, 14], object recognition [24, 66], video structure matching/parsing [83, 93, 92] and au...

متن کامل

Disentangling Motion, Foreground and Background Features in Videos

This paper introduces an unsupervised framework to extract semantically rich features for video representation. Inspired by how the human visual system groups objects based on motion cues, we propose a deep convolutional neural network that disentangles motion, foreground and background information. The proposed architecture consists of a 3D convolutional feature encoder for blocks of 16 frames...

متن کامل

Enhancement of Learning Based Image Matting Method with Different Background/Foreground Weights

The problem of accurate foreground estimation in images is called Image Matting. In image matting methods, a map is used as learning data, which is produced by those pixels that are definitely foreground, definitely background ,and unknown. This three-level pixel map is often referred to as a trimap, which is produced manually in alpha matte datasets. The true class of unknown pixels will be es...

متن کامل

Interaction between Attention and Bottom-Up Saliency Mediates the Representation of Foreground and Background in an Auditory Scene

The mechanism by which a complex auditory scene is parsed into coherent objects depends on poorly understood interactions between task-driven and stimulus-driven attentional processes. We illuminate these interactions in a simultaneous behavioral-neurophysiological study in which we manipulate participants' attention to different features of an auditory scene (with a regular target embedded in ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Expert Systems With Applications

سال: 2021

ISSN: ['1873-6793', '0957-4174']

DOI: https://doi.org/10.1016/j.eswa.2021.115285